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(57) Abstract: The early prediction 
or diagnosis of sepsis advantageously 
allows for clinical intervention before the 
disease rapidly progresses beyond initial 
stages to the more severe stages, such 
as severe sepsis or septic shock, which 
are associated with high mortality. Early 
prediction or diagnosis is accomplished 
using a molecular diagnostics approach, 
involving comparing an individual's 
profile of biomarker expression to profiles 
obtained from one or more control, 
or reference, populations, which may 
include a population that develops sepsis. 
Recognition of features in the individual's 
biomaiicer profile that are characteristic 
of the onset of sepsis allows a clinician to 
diagnose the onset of sepsis from a bodily 
fluid isolated at the individual at a single 
point in time. The necessity of monitoring 
the patient over a period of time is, 
therefore, avoided, advantageously 
allowing clinical intervention before the 
onset of serious symptoms of sepsis. 
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DIAGNOSIS OF SEPSIS OR SIRS USING BIOMARKER PROFILES 



[0001] The present application claims priority to United States Provisional Patent 
Application Serial No. 60/425^322, filed November 12, 2002, which is herein incorporated by 
reference in its entirety. 

FIELD OF THE mVENTION 
[0002] The present invention relates to methods of diagnosing or predicting sepsis or its 
stages of progression in an individual. The present invention also relates to methods of 
diagnosing systemic inflammatory response syndrome in an individual. 

B ACKGROTTNn OF TTTO INVENTION 
[0003] Early detection of a disease condition typically allows for a more effective 
therapeutic treatment with a correspondingly more favorable clinical outcome. In many 
cases^ however, early detection of disease symptoms is problematic; hence, a disease may 
become relatively advanced before diagnosis is possible. Systemic inflammatory conditions 
represent one such class of diseases. These conditions, particularly sepsis, typically result 
from an interaction between a pathogenic microorganism and the host's defense system that 
triggers an excessive and dysregulated inflammatory response in the host The complexity of 
the host's response during the systemic inflammatory response has complicated efforts 
towards understanding disease pathogenesis. (Reviewed in Healy, Annul Pharmacother, 36: 
648-54 (2002).) An incomplete understanding of the disease pathogenesis, in turn, 
contributes to the difficulty in jSnding diagnostic biomarkers. Early and reliable diagnosis is 
imperative, however, because of the remarkably rapid progression of sepsis into a life- 
threatening condition. 

[0004] Sepsis follows a well-described time course, progressing from systemic 
inflammatory response syndrome ("SIRS")-negative to SIRS-positive to sepsis, which may 
then progress to severe sepsis, septic shock, multiple organ dysftinction ("MOD"), and 
ultimately death. Sepsis also may arise in an infected individual Mdien the individual 
subsequently develops SIRS. "SIRS" is commonly defined as the presence of two or more of 
the following parameters: body temperature greater than 38*^0 or less than 36°C; heart rate 
greater than 90 beats per minute; respiratory rate greater than 20 breaths per minute; Pco2 less 
than 32 mm Hg; and a white blood cell count either less than 4.0x10^ ceUs/L or greater than 
12.0x10^ ceUs/L, or having greater than 10% immature band forms. "Sepsis" is commonly 
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defined as SIRS with a coBfinned infectious process. "Severe sepsis" is associated with 
MOD, hypotension, disseminated intravascular coagulation ("DIG") or hypoperfiision 
abnormalities, including lactic acidosis, oliguria, and changes ia mental status. "Septic 
shock" is commonly defined as sepsis-induced hypotension that is resistant to fluid 
resuscitation with the additional presence of hypoperfusion abnormalities. 

[0005] Documentkig the presence of the pathogenic microorganisms clinically significant 
to sepsis has proven difficult. Causative microorganisms typically are detected by culturing a 
patient's blood, sputum, urine, wound secretion, in-dwelling line catheter surfaces, etc. 
Causative microorganisms, however, may reside only ia certain body microenvironments 
such that the particular material that is cultured may not contain the contaminating 
microorganisms. Detection may be complicated further by low numbers of microorganisms 
at the site of infection. Low numbers of pathogens in blood present a particular problem for 
diagnosing sepsis by culturing blood. In one study, for example, positive culture results were 
obtained in only 17% of patients presenting clinical manifestations of sepsis. (Rangel- 
Frausto et ah, JAMA 273: 1 17-23 (1995).) Diagnosis can be further complicated by 
contamination of samples by non-pathogenic microorganisms. For example, only 12.4% of 
detected microorganisms were clinically significant in a study of 707 patients with 
septicemia. (Weinstein et aL, Clinical Infectious Diseases 24: 584-602 (1997).) 

[0006] The difficulty in early diagnosis of sepsis is reflected by the high morbidity and 
mortality associated with the disease. Sepsis currently is the tenth leading cause of death in 
the United States and is especially prevalent among hospitalized patients in non-coronary 
intensive care units (ICUs), where it is the most common cause of death. The overall rate of 
mortality is as high as 35%, with an estimated 750,000 cases per year occurring in the United 
States alone. The aimual cost to treat sepsis in the United States alone is in the order of 
billions of dollars. 

[0007] A need, therefore, exists for a method of diagnosing sepsis sufficiently early to 
allow ejBfective intervention and prevention. Most existing sepsis scoring systems or 
predictive models predict only the risk of late-stage complications, including death, in 
patients who already are considered septic. Such systems and models, however, do not 
predict the development of sepsis itself. What is particularly needed is a way to categorize 
those patients with SIRS who will or will not develop sepsis. Currentiy, researchers will 
typically define a single biomarker that is caressed at a diJBferent level in a group of septic 
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patients versus a normal (i.e., non~septic) control group of patients. United States Patent 
Application Serial No. 10/400,275, filed March 26, 2003, the entire contents of which are 
hereby incorporated by reference, discloses a method of indicating early sepsis by analyzing 
time-dependent changes in the expression level of various biomarkers. Accordingly, optimal 
methods of diagnosing early sepsis currently require both measuring a plurality of biomarkers 
and monitoring the expression of these biomarkers over a period of time. 

[0008] There is a continuing urgent need in the art to diagnose sepsis with specificity and 
sensitivity, without the need for monitoring a patient over time. Ideally, diagnosis would be 
made by a technique that accurately, rapidly, and simultaneously measures a plurality of 
biomarkers at a single point in time, thereby minimizing disease progression during the time 
required for diagnosis. 

SUMMARY OF THE INVENTION 
[0009] The present invention allows for accurate, rapid, and sensitive prediction and 
diagnosis of sepsis through a measurement of more than one biomarker taken from a 
biological sample at a single point in time. This is accomplished by a molecular diagnostics 
approach, in which a biomarker profile is obtained at a single point in time from an 
individual, particularly an individual at risk of developing sepsis, having sepsis, or suspected 
of having sepsis, and comparing the biomarker profile firom the individual to a reference 
biomarker profile. The reference biomarker profile may be obtained from a population of 
individuals (a "reference population") who are, for example, afflicted with sepsis or who are 
suffering firom either the onset of sepsis or a particular stage in the progression of sepsis. If 
the biomarker profile firom the individual contains appropriately characteristic features of the 
biomarker profile from the reference population, then the individual is diagnosed as having a 
more likely chance of becoming septic, as being afflicted with sepsis or as beiag at the 
particular stage in the progression of sepsis as the reference population. The reference 
biomarker profile may also be obtained from various populations of individuals including 
those who are suffering from SIRS or those who are suffering from an infection but who are 
not suffering from SIRS. Accordingly, the present invention allows the clinician to 
determine, inter alia, those patients who do not have SIRS, who have SIRS but axe not likely 
to develop sepsis within the time frame of the investigation, who have sepsis, or who are at 
risk of eventually becoming septic. 
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[0010] Although the methods of the present invention are particularly useful for detecting 
or predicting the onset of sepsis in SIRS patients, one of ordinary skill in the art will 
understand that the present methods may be used for any patient including, but not limited to, 
patients suspected of having SIRS or of being at any stage of sepsis. For example, a 
biological sample could be taken from a patient, and a profile of biomarkers in the sample 
could be compared to several different reference biomarker profiles, each profile derived 
fix»m individuals such as, for example, those having SIRS or being at a particular stage of 
sepsis. Classification of the patient's biomarker profile as corresponding to the profile 
derived firom a particular reference population is predictive that the patient falls within the 
reference population. Based on the diagnosis resulting from the mefliods of the present 
invention, an appropriate treatment regimen could then be initiated. 

[0011] Existing methods for the diagnosis or prediction of SIRS, sepsis or a stage in the 
progression of sepsis are based on clinical signs and symptoms that are nonspecific; 
therefore, the resulting diagnosis often has limited clinical utility. Because the metliods of 
the present invention accurately detect various stages of sepsis, they can be used to identify 
those individuals who might appropriately be enrolled in a therapeutic study. Because sepsis 
may be predicted or diagnosed from a "snapshot" of biomarker expression in a biological 
sample obtained at a single point in time, this therapeutic study may be kdtiated before the 
onset of serious clinical symptoms. Because the biological sample is assayed for its 
biomarker profile, identification of the particular biomarkers is imnecessary. Nevertheless, 
the present invention provides methods to identify specific biomarkers of the profiles that are 
characteristic of sepsis or of a particiilar stage in the progression of sepsis. Such biomarkers 
themselves will be useful tools in predicting or diagnosing sepsis. 

[0012] To that end, the present invention provides various nucleic acid biomarkers or 
combinations of nucleic acid biomarkers. The nucleic acid biomarkers are identified from a 
biological sample (e,g.^ a blood sample) obtained from an individual. Without being bound 
to or limited by theory, the levels of expression of certain mRNAs that encode for proteins 
involved in the host's response to SIRS or various stages of sepsis, including the onset of 
sepsis prior to clinical suspicion of sepsis using conventional techniques, are predictive or 
diagnostic of the stage of sepsis or are diagnostic of SIRS. Cells in the biological sample 
express mRNAs encoding these proteins, and the detection of the level of expression of these 
mRNAs may be used to determine the status of sepsis or diagnose SIRS in the individual. 
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Nucleic acid biomarkers of the invention include, but are not limited to, mRNAs, nucleic 
acids that are made from these mRNAs, and nucleic acids capable of forming a duplex with 
mRNAs. The nucleic acid biomarkers of the present invention are termed "host response 
biomarkers." 

[0013] Accordingly, the present invention also provides a kit comprising at least one, 
two, three, four, five, 10 or more host response biomarkers selected firom the group consisting 
of nucleic acids listed in any one of TABLES 2-10. The nucleic acids listed in TABLES 2 
— 10 are identified by the protein that they encode. A host response biomarker for the 
purpose of the invention includes, but is not limited to, an mRNA that encodes the identified 
protein, a cDNA made fi*om the mRNA, and a DNA or other molecule that is able to form a 
specific complex (e,g„ a hybridized duplex) wilii an mRNA that encodes the identified 
protein. In one embodiment, the kit comprises a DNA probe (e.g.^ a cDNA or 
oligonucleotide) capable of forming a duplex with an mRNA that corresponds to a biomarker 
listed in TABLES 2-10. In another embodiment, the present invention provides a method 
comprising using DNA probes to construct a nucleic acid array capable of determining the 
status of sepsis or of diagnosing SIRS in the iadividual by detecting mRNA biomarkers 
present in a biological sample fi:om the individual. 

[0014] The present invention fiirtihier provides a profile of host response biomarkers 
comprising at least two, three, four, five, 10 or 20 or more features that are meastirable 
characteristics of nucleic acids and that contribute to the classification of an iadividual in a 
reference population with an accuracy of at least about 60%, at least about 70%, at least about 
80%, at least about 90%, at least about 95%, or about 100%, The reference population is 
selected from the group consisting of a normal reference population, a SIRS-positive 
reference population, an infected/SIRS -negative reference population, a sepsis-positive 
reference population, a reference population at a particular stage ra the progression of sepsis, 
a SIRS-positive reference population confirmed as having sepsis by conventional techniques 
after about 0 — 36 hours, a SIRS-positive reference population confirmed as having sepsis by 
conventional techniques after about 36 - 60 hours, and a SIRS-positive reference population 
confirmed as having sepsis by conventional techniques after about 60 - 84 hours. 

[0015] The present invention further i>rovides a method of isolating a host response 
biomarker, where the biomarker is capable of being used in a method of determining the 
status of sepsis or diagnosing SIRS in an individual. This method comprises obtaining a 
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reference biomarker profile of host response biomarkers from biological samples obtained 
firom a population of individuals and identifying a feature of the reference biomarker profile 
capable of detenmning the status of sepsis or diagnosing SIRS in the individual. A host 
response biomarker that corresponds to the feature is then identified and isolated. 

BRIEF DESCRIFnON OF THF. TO AWTNOS 
[0016] FIGURE 1 illustrates the progression of SIRS to sepsis. The condition of sepsis 
consists of at least three stages, wdfh a septic patient progressing from severe sepsis to septic 
shock to multiple organ dysfunction. 

[0017] FIGURE 2 shows the relationship between sepsis and SIRS. The various sets 
shown in the Venn diagram correspond to populations of individuals having the indicated 
condition. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0018] The present invention allows for the rapid, sensitive, and accurate diagnosis or 
prediction of sepsis using one or more biological samples obtained from an individual at a 
single time point ("snapshot") or during the course of disease progression. Advantageously, 
sepsis may be diagnosed or predicted prior to the onset of clinical symptoms, thereby 
allowing for more effective therapeutic mterventiion. 

[0019] "Systemic inflammatory response syndrome," or "SIRS," refers to a clinical 
response to a variety of severe clinical iosults, as manifested by two or more of the following 
conditions withbi a 24-hour period: 

body temperature greater than 38°C (100.4°F) or less than 36°C (96.8°F); 

• heart rate (HR) greater than 90 beats/minute; 

• respiratory rate (RR) greater than 20 breaths/mmute, or Pco2 less than 32 mm 
Hg, or requiring mechanical ventilation; and 

• white blood cell count (WBC) either greater than 12.0x1 0^/L or less than 4,0 
X 10^/L or having greater than 10% hnmature forms (bands). 

[0020] These symptoms of SIRS represent a consensus definition of SIRS that may be 
modified or supplanted by an improved definition in the future. The present definition is 
used to clarify current clinical practice and does not represent a critical aspect of the 
invention. 
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[0021] A patient with SIRS has a clinical presentation that is classified as SIRS, as 
defined above, but is not clinically deemed to be septic. Individuals who are at risk of 
developing sepsis include patients in an ICU and those who have otherwise suffered fi-om a 
physiological trauma, such as a bum or other insult. "Sepsis" refers to a SIRS-positive 
condition that is associated with a confirmed infectious process. Clinical suspicion of sepsis 
arises firom the suspicion that the SIRS-positive condition of a SIRS patient is a result of an 
infectious process. As used herein, "sepsis" includes all stages of sepsis including, but not 
limited to, the onset of sepsis, severe sepsis and MOD associated with the end stages of ' 
sepsis. 

[0022] The "onset of sepsis" refers to an early stage of sepsis, i.e. , prior to a stage when 
the clinical manifestations are sufficient to support a clinical suspicion of sepsis. Because the 
methods of the present invention are used to detect sepsis prior to a time that sepsis would be 
suspected using conventional techniques, the patient's disease status at early sepsis can only 
be confirmed retrospectively, when the manifestation of sepsis is more clinically obvious. 
The exact mechanism by which a patient becomes septic is not a critical aspect of the 
invention. The methods of the present invention can detect changes in the biomarker profile 
independent of the origin of the infectious process. Regardless of how sepsis arises, the 
methods of the present invention allow for determining the status of a patient having, or 
stispected of having, sepsis or SIRS, as classified by previously used criteria. 

[0023] "Severe sepsis" refers to sepsis associated with organ dysfunction, hypoperfusion 
abnormalities, or sepsis-induced hypotension. Hypoperfusion abnormalities include, but are 
not limited to, lactic acidosis, oliguria, or an acute alteration in mental status. "Septic shock" 
refers to sepsis-induced hypotension that is not responsive to adequate intravenous fluid 
challenge and with manifestations of peripheral hypoperftision. A "converter patient" refers 
to a SIRS-positive patient who progresses to clinical suspicion of sepsis during the period the 
patient is monitored, typically dxrang an ICU stay. A "non-converter patient" refers to a 
SIRS-positive patient who does not progress to clinical suspicion of sepsis during the period 
the patient is monitored, typically during an ICU stay. 

[0024] A "biomarker" is virtually any biological compound, such as a protein and a 
fi-agment thereof, a peptide, a polypeptide, a proteoglycan, a glycoprotein, a lipoprotein, a 
carbohydrate, a lipid, a nucleic acid, an organic or inorganic chemical, a natural poljncner, and 
a small molecule, that is present in the biological sample and that may be isolated fiom, or 
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measured in, the biological sample. Furfhemiore, a biomarker can be the entire intact 
molecule, or it can be a portion thereof that may be partially functional or recognized, for 
example, by an antibody or other specific binding protein. A biomarker is considered to be 
informative if a measurable aspect of the biomarker is associated with a given state of the 
patient, such as a particular stage of sepsis. Such a measurable aspect may include, for 
example, the presence, absence, or concentration of the biomarker in the biological sample 
firom the individual and/or its presence as part of a profile of biomarkers. Such a measurable 
aspect of a biomarker is defined herein as a "feature." A feature may also be a ratio of two or 
more measurable aspects of biomarkers, which biomarkers may or may not be of known 
identity, for example. A ^T^iomarker profile" comprises at least two such features, where the 
features can correspond to the same or different classes of biomarkers such as, for example, a 
nucleic acid and a carbohydrate. A biomarker profile may also comprise at least three, four, 
five, 10, 20, 30 or more features. In one embodiment, a biomarker profile comprises 
hundreds, or even thousands, of features. In another embodiment, the biomarker profile 
comprises at least one measurable aspect of at least one internal standard. 

[0025] A "phenotypic change" is a detectable change in a parameter associated with a 
given state of the patient. For instance, a phenotypic change may include an increase or 
decrease of a biomarker in a bodily fluid, where the change is associated with sepsis or the 
onset of sepsis. A phenotypic change may further include a change in a detectable aspect of a 
given state of the patient that is not a change in a measurable aspect of a biomarker. For 
example, a change in phenotype may include a detectable change in body temperature, 
respiration rate, pulse, blood pressure, or other physiological parameter. Such changes can be 
determined via clinical observation and measurement using conventional techniques that are 
well-known to the skilled artisan. As used herein, "conventional techniques" are those 
techniques that classify an individual based on phenotypic changes without obtaining a 
biomarker profile according to the present invention. 

[0026] A "decision rule" is a method used to classify patients. This rule can take on one 
or more forms that are known in the art, as exemplified in Hastie et al, in "The Elements of 
Statistical Learning," Springer- Verlag (Springer, New York (2001)), herein incorporated by 
reference in its entirety. Analysis of biomarkers in the complex mixture of molecules within 
the sample generates features in a data set A decision rule may be used to act on a data set of 
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features to, inter alia^ predict the onset of sepsis, to determine the progression of sepsis, to 
diagnose sepsis, or to diagnose SIRS. 

[0027] The application of the decision rule does not require perfect classification. A 
classification may be made with at least about 90% certainty, or even more, in one 
embodiment. In other embodiments, the certainty is at least aboxit 80%, at least about 70%, 
or at least about 60%. The usefiil degree of certainty may vary, depending on the particular 
method of the present invention. "Certainty" is defined as the total number of accurately 
classified individuals divided by the total nimiber of individuals subjected to classification. 
As used herein, "certainty" means "accuracy." Classification may also be characterized by 
its "sensitivity." The "sensitivity" of classification relates to the percentage of sepsis patients 
who were correctly identified as having sepsis. "Sensitivity" is defined in the art as the 
number of true positives divided by the sum of true positives and false negatives. In contrast, 
the "specificity" of the method is defined as the percentage of patients who were correcfly 
identified as not having sepsis. That is, "specificity*' relates to the number of true negatives 
divided by the siim of true negatives and false positives. In one embodiment, the sensitivity 
and/or specificity is at least about 90%, at least about 80%, at least about 70% or at least 
about 60%. The number of features that may be used to classify an individual with adequate 
certainty is typically about four. Depending on the degree of certainty sought, however, the 
number of features may be more or less, but in all cases is at least one. In one embodiment, 
the number of features that may be used to classify an individual is optimized to allow a 
classification of an individual with high certainty. 

[0028] "Determining the status" of sepsis or SIRS in a patient encompasses classification 
of a patient's biomarker profile to (1) detect the presence of sepsis or SIRS in the patient, 
(2) predict the onset of sepsis or SIRS in the patient, or (3) measure the progression of sepsis 
in a patient "Diagnosing" sepsis or SIRS means to identify or detect sepsis or SIRS in the 
patient. Because of the greater sensitivity of the present invention to detect sepsis before an 
overtly observable clinical manifestation, the identification or detection of sepsis includes the 
detection of the onset of sepsis, as defined above. That is, "predicting the onset of sepsis" 
means to classify the patient's biomarker profile as correspondiag to the profile derived firom 
individuals who are progressing firom a particular stage of SIRS to sepsis or firom a state of 
being infected to sepsis (i,e., fiom infection to infection with concomitant SIRS). "Detecting 
the progression" or "determining the progression" of sepsis or SIRS means to classify the 
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biomarker profile of a patient who is akeady diagnosed as having sepsis or SIRS, For 
instance, classifying the biomarker profile of a patient who has been diagnosed as having 
sepsis can encompass detecting or determining the progression of the patient from sepsis to 
severe sepsis or to sepsis with MOD. 

[0029J According to the present invention, sepsis may be diagnosed or predicted by 
obtaining a profile of biomarkers from a sample obtained from an individual. As used herein, 
"obtain" means 'to come into possession of." The present invention is particularly useful in 
predictiuag and diagnosing sepsis in an individual who has an infection, or even sepsis, but 
who has not yet been diagnosed as havuig sepsis, who is suspected of having sepsis, or who 
is at risk of developiag sepsis. Li the same manner, the present invention may be used to 
detect and diagnose SIRS in an individual. That is, the present invention may be used to 
confirm a clinical suspicion of SIRS. The present invention also may be used to detect 
various stages of the sepsis process such as infection, bacteremia, sepsis, severe sepsis, septic 
shock and the like. 

[0030] The profile of biomarkers obtained from an individual, Le., the test biomarker 
profile, is compared to a reference biomarker profile. The reference biomarker profile can be 
generated from one individual or a population of two or more individuals. The population, 
for example, may comprise three, four, five, ten, 15, 20, 30, 40, 50 or more individuals. 
Furthermore, the reference biomarker profile and the individual's (test) biomarker profile that 
are compared in the methods of the present invention may be generated from the same 
individual, provided that the test and reference biomarker profiles are generated from 
biological samples taken at different time points and compared to one another. For example, a 
sample may be obtaiaed from an individual at the start of a study period. A reference 
biomarker profile taken from that sample may then be compared to biomarker profiles 
generated from subsequent samples from the same individual. Such a comparison may be 
used, for example, to determine the status of sepsis in the individual by repeated 
classifications over time. 

[0031] The reference populations may be chosen from individuals who do not have SIRS 
("SIRS-negative"), from individuals who do not have SIRS but who are suffering from an 
infectious process, from individuals who are suffering from SIRS without the presence of 
sepsis ("SIRS-positive"), from individuals who are sxiffering from the onset of sepsis, from 
individ\jals who are sepsis-positive and suffering from one of the stages in tiie progression of 
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sepsis, or from individuals with a physiological trauma that increases the risk of developing 
sepsis. Furthermore, the reference populations may be SIRS-positive and are then 
subsequently diagnosed with sepsis using conventional techniques. For example, a 
population of SIRS-positive patients used to generate the reference profile may be diagnosed 
with sepsis about 24, 48, 72, 96 or more hours after biologLcal samples were taken from them 
for the purposes of generating a reference biomarker profile. In one embodiment, the 
population of SlRS-positive individuals is diagnosed with sepsis using conventional 
techniques about 0-36 hours, about 36-60 hours, about 60-84 hours, or about 84-108 hours 
after the biological samples were taken. If the biomarker profile is indicative of sepsis or one 
of its stages of progression, a clinician may begin treatment prior to the manifestation of 
clinical symptoms of sepsis. Treatment typically will involve examining the patient to 
determine the soi3rce of the infection. Once locating the source, the clinician typically will 
obtain cultures from the site of the infection, preferably before beginning relevant empirical 
antimicrobial therapy and perhaps additional adjunctive therapeutic measures, such as 
draining an abscess or removing an infected catheter, Hierapies for sepsis are reviewed in 
Healy, supra. 

[0032] The methods of the present invention comprise comparing an individual's 
biomarker profile with a reference biomarker profile. As used herein, "comparison" includes 
any means to discern at least one difference in the individual's and the reference biomarker 
profiles. Thus, a comparison may include a visual inspection of chromatographic spectra, 
and a comparison may include arilinnetical or statistical comparisons of values assigned to 
the features of the profiles. Such statistical comparisons include, but are not limited to, 
applying a decision rule. If the biomarker profiles comprise at leeust one internal standard, the 
comparison to discem a difference in the biomarker profiles may also include features of 
these internal standards, such that features of the biomarker are correlated to features of the 
internal standards. The comparison can predict, inter alia, the chances of acquiring sepsis or 
SIRS; or the comparison can confirm the presence or absence of sepsis or SIRS; or the 
comparison can indicate the stage of sepsis at which an individual may be. 

[0033] The present invention, therefore, obviates the need to conduct time-intensive 
assays over a monitoring period, as well as the need to identify each biomarker. Although the 
invention does not require a monitoring period to classify an individual, it will be understood 
that repeated classifications of the individual, repeated snapshots, may be taken over time 
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imtil the individual is no longer at risk. Alternatively, a profile of biomarkers obtained firom 
the individual may be compared to one or more profiles of biomarkers obtained &om the 
same individual at different points in time. The artisan will appreciate that each comparison 
made in the process of repeated classifications is capable of classifying the individual as 
belonging to or not belonging to the reference population. 

[0034] Individuals having a variety of physiological conditions corresponding to the 
various stages in the progression of sepsis, firom the absence of sepsis to MOD, may be 
distinguished by a characteristic biomarker profile. As used herein, an "individual" is an 
animal, preferably a mammal, more preferably a human or non-himian primate. The teims 
"individual," "subject" and ""patient" are used interchangeably herein. The individual can be 
normal, suspected of having SIRS or sepsis, at risk of developing SIRS or sepsis, or 
confirmed as having SIRS or sepsis. While there are many known biomarkers that have been 
implicated in the progression of sepsis, not all of these markers appear in the initial, pre- 
clinical stages. The subset of biomarkers characteristic of early-stage sepsis may, in fact, be 
determined only by a retrospective analysis of samples obtained firom individuals who 
ultimately manifest clinical symptoms of sepsis. Without being bound by theory, even an 
initial pathologic infection that results in sepsis may provoke physiological changes tiiat are 
reflected in particular changes in biomarker expression. Once the characteristic biomarker 
profile of a stage of sepsis, for example, is determined, the profile of biomarkers fix)m a 
biological sample obtained firom an individual may be compared to this reference profile to 
determine whether the test subject is also at that particular stage of sepsis, 

[0035J The progression of a population from one stage of sepsis to another, or from 
normalcy (ie., a condition characterized by not having sepsis or SIRS) to sepsis or SIRS and 
vice versa, will be characterized by changes in biomarker profiles, as certain biomarkers are 
expressed at increasingly higher levels and the expression of other biomarkers becomes 
down-regulated. These changes hi biomarker profiles may reflect the progressive 
establishment of a physiological response in the reference population to infection and/or 
inflammation, for example. The skilled artisan will appreciate that the biomarker profile of 
the reference population also will change as a physiological response subsides. As stated 
above, one of the advantages of the present invention is the capability of classifying an 
individual, using a biomarker profile from a single biological sample, as having membei^hip 
in a particular population. The artisan will appreciate, however, that the determination of 
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whether a particular physiological response is becoming established or is subsiding may be 
facilitated by a subsequent classification of the individual. To this end, the present invention 
provides numerous biomarkers that both increase and decrease in level of expression as a 
physiological response to sepsis or SIRS is established or subsides. For example, an 
investigator can select a feature of an individual's biomarker profile that is known to change 
in intensity as a physiological response to sepsis becomes established. A comparison of the 
same feature in a profile fiom a subsequent biological sample from the individual can 
estabKsh whether the individual is progressing toward more severe sepsis or is progressing 
toward normalcy. 

[0036] The molecular identity of biomarkers is not essential to the invention. Indeed, the 
present invention should not be limited to biomarkers that have previously been identified. 
{See, e,g,, U.S. Patent Application Serial No. 10/400,275, filed March 26, 2003.) It is, 
tiierefore, expected that novel biomarkers will be identified that are characteristic of a given 
population of individuals, especially a population in one of the early stages of sepsis. In one 
embodiment of the present invention, a biomarker is identified and isolated. It then may be 
used to raise a specifically-binding antibody, which can facilitate biomarker detection in a 
variety of diagnostic assays. For this purpose, any immunoassay may use any antibodies, 
antibody fragment or derivative capable of binding the biomarker molecules (e.g,^ Fab, Fv, or 
scFv fragments). Such immunoassays are well-known in the art. If the biomarker is a 
protein, it may be sequenced and its encoding gene may be cloned using well-established 
techniques. 

[0037] The methods of the present invention may be employed to screen, for example, 
patients admitted to an ICU. A biological sample such as, for example, blood, is taken 
immediately upon admission. The complex mixture of proteins and other molecules within 
the blood is resolved as a profile of biomarkers. This may be accomplished through the use 
of any technique or combination of techniques that reproducibly distinguishes these 
molecules on the basis of some physical or chemical property. In one embodiment, the 
molecules are immobilized on a matrix and then are separated and distinguished by laser 
desorption/ionization time-of-flight mass spectrometry. A spectrum is created by the 
characteristic desorption pattern that reflects the mass/charge ratio of each molecule or its 
fragments. In another embodiment, biomarkers are selected from the various mRNA species 
obtained from a cellular extract, and a biomarker profile is obtained by hybridizing the 
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individual's mRNA species to an array of cDNAs. The diagnostic use of cDNA arrays is 
well-known in the art. {See, e.g.^ Zoo, et al^ Oncogene 21: 4855-4862 (2002).) In yet 
another embodiment, a profile may be obtained using a combination of protein and nucleic 
acid separation methods. 

[0038] The invention also provides kits that are useful in determining the status of sepsis 
or diagnosing SIRS in an individual. The kits of the present iavention comprise at least one 
biomarker. Specific biomarkers useful in the present invention are set forth herein. The 
biomarkers of the kit can be used to generate biomarker profiles according to the present 
invention. Generally, the biomarkers of the kit will bind, with at least some specificity, to the 
biomarker molecules contained in the biological sample fi-om which the biomarker profile is 
generated. Examples of classes of compounds of the kit include, but are not limited to, 
proteins, and fi^agments thereof, peptides, polypeptides, proteoglycans, glycoproteins, 
lipoproteins, carbohydrates, lipids, nucleic acids, organic and inorganic chemicals, and 
natural and synthetic polymers. The biomarker(s) may be part of an array, or the 
biomarker(s) may be packaged separately and/or individually. The kit may also comprise at 
least one internal standard to be used in generating the biomarker profiles of the present 
invention. Likewise, the internal standards can be any of the classes of compounds described 
above. The kits of the present invention also may contain reagents that can be used to 
detectably label biomarkers contained in the biological samples firom which the biomarker 
profiles are generated. For this purpose, the kit may comprise a set of antibodies or 
fimctional firagments thereof that specifically bind at least two, three, four, five, 10, 20 or 
more of flie biomarkers set forth in any one of the following TABLES that list biomarkers. 
The antibodies tiiemselves may be detectably labeled. The kit also may comprise a specific 
biomarker binding component, such as an aptamer. If the biomarkers comprise a nucleic 
acid, the kit may provide an oligonucleotide probe that is capable of forming a duplex with 
the biomarker or with a complementary strand of a biomarker. The oligonucleotide probe 
may be detectably labeled. 

10039] The kits of the present invention may also include pharmaceutical excipients, 
diluents and/or adjuvants when the biomarker is to be used to raise an antibody. Examples of 
pharmaceutical adjuvants include, but are not limited to, preservatives, wetting agents, 
emulsifying agents, and dispersing agents. Prevention of the action of microorganisms can 
be ensured by the inclusion of various antibacterial and antifungal agents, for example. 
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paraben, chlorobutanol, phenol sorbic acid, and tfcie like. It may also be desirable to include 
isotonic agents such as sugars, sodium chloride, and the like. Prolonged absorption of an 
injectable pharmaceutical form can be brought about by the inclusion of agents which delay 
absorption such as aluminum monostearate and gelatin. 

Generation of Biomarker Profiles 

[0040] According to one embodiment, the methods of the present invention comprise 
obtaining a projBle of biomarkers from a biological sample taken from an individual. The 
biological sample may be blood, plamsa, serum, saliva, sputum, urine, cerebral spinal fluid, 
cells, a cellular extract, a tissue sample, a tissue biopsy, a stool sample and the like. The 
reference biomarker profile may be obtained, for example, from a population of individuals 
selected from the group consisting of SIRS-negative individuals, SIRS-positive individuals, 
individuals who are suffering from the onset of sepsis and individuals who already have 
sepsis. The reference biomarker profile from individuals who already have sepsis may be 
obtaiaed at any stage in the progression of sepsis, such as infection, bacteremia, severe sepsis, 
septic shock or MOD. 

[0041] In one embodiment, a separation method may be used to create a profile of 
biomarkers, such that only a subset of biomarkers within the sample is analyzed. For 
example, the biomarkers that are analyzed in a sample may consist of mRNA species from a 
cellular extract, which has been fractionated to obtain only the nucleic acid biomarkers within 
the sample, or the biomarkers may consist of a fraction of the total complement of proteins 
withiQ the sample, which have been fractionated by chromatographic techniques. 
Alternatively, a profile of biomarkers may be created without employing a separation 
method. For example, a biological sample may be interrogated with a labeled compound that 
forms a specific complex with a biomarker in the sample, where the intensity of the label in 
the specific complex is a measxirable characteristic of the biomarker. A suitable compound 
for forming such a specific complex is a labeled antibody. In one embodiment, a biomarker 
is measured ixsing an antibody with an amplifiable nucleic acid as a label. In yet another 
embodiment, the nucleic acid label becomes amplifiable when two antibodies, each 
conjugated to one strand of a nucleic acid label, interact with the biomarker, such that the two 
nucleic acid strands form an amplifiable nucleic acid. 

[0042] In another embodiment, the biomarker profile may be derived from an assay, such 
as an array, of nucleic acids, where tiie biomarkers are the nucleic acids or complements 
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thereof. For example^ the biomarkers may be ribonucleic acids. The biomarker profile also 
may be obtained using a method selected firom the group consisting of nuclear magnetic 
resonance, nucleic acid arrays, dot blotting, slot blotting, reverse transcription amplification 
and Northern analysis. In another embodiment, the biomarker profile is detected 
immunologically by reacting antibodies, or functional firagments thereof, specific to the 
biomarkers. A functional fiiagment of an antibody is a portion of an antibody that retains at 
least some ability to bind to the antigen to which the complete antibody binds. The 
firagments, which include, but are not limited to, scFv fiagments. Fab fragments and F(ab)2 
fragments, can be recombinantly produced or enzymatically produced. In another 
embodiment, specific binding molecules other than antibodies, such as aptamers, may be 
used to bind the biomarkers. In yet another embodiment, the biomarker profile may comprise 
a measurable aspect of an infectious agent or a component thereof. In yet another 
embodiment, the biomarker profile may comprise measurable aspects of small molecules, 
which may include firagments of proteins or nucleic acids, or which may include metabolites. 

[0043] Biomarker profiles may be generated by the use of one or more separation 
methods. For example, suitable separation methods may include a mass spectrometry 
method, such as electrospray ionization mass spectrometry (ESI-MS), ESI-MS/MS, ESI- 
MS/(MS)" (n is an integer greater than zero), matrix-assisted laser desorption ionization time- 
of-flight mass spectrometry (MALDI-TOF-MS), surface-enhanced laser 
desorption/ionization thne-of-flig^it mass spectrometry (SELDI-TOF-MS), 
desorptioi}/ionization on silicon (DIOS), secondary ion mass spectrometry (SIMS), 
quadrupole time-of-flight (Q-TOF), atmospheric pressure chemical ionization mass 
spectrometry (APCI-MS), APCI-MS/MS, APC^(MS)^ atmospheric pressure photoiomzadon 
mass spectrometry (APPI-MS), APPI-MS/MS, and APPI-(MS)". Other mass spectrometry 
methods may include, inter alia, quadrupole, fourier transform mass spectrometry (FTMS) 
and ion trap. Other suitable separation methods may include chemical extraction partitioning, 
column chromatography, ion exchange chromatography, hydrophobic (reverse phase) liquid 
chromatography, isoelectric focusing, one-dimensional polyacrylamide gel electrophoresis 
(PAGE), two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) or other 
chromatography, such as thin-layer, gas or liquid chromatography, or any combination 
thereof. In one embodiment, the biological sample may be fractionated prior to application of 
the separation method. 
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[0044] Biomarker profiles also may be generated by methods that do not require physical 
separation of the biomarkers themselves. For example, nuclear magnetic resonance (NMR) 
spectroscopy may be used to resolve a profile of biomarkers firom a complex mixture of 
molecules. An analogous use of NMR to classify tumors is disclosed in Hagberg, NMR 
Biomed 11: 148-56 (1998), for example. Additional procedures include nucleic acid 
amplification technologies, which may be used to generate a profile of biomarkers without 
physical separation of individual biomarkers. (See Stordeur et al^ J, Immunol Methods 259: 
55-64 (2002) and Tan et al, Proc. NaflAcad Set USA 99: 1 1387-11392 (2002), for 
example.) 

[0045J hi one embodiment, laser desorption/ionization tune-of-flight mass spectrometry 
is used to create a profile of biomarkers where the biomarkers are proteins or proteia 
fragments that have been ionized and vaporized off an immobilizing support by incident laser 
radiation. A profile is then created by the characteristic time-of-flight for each protein, which 
depends on its mass-to-charge ("m/z") mtio. A variety of laser desorption/ionization 
techniques are known in the art. {See, e.g., Guttman et al^ Anal Chem. 73: 1252-62 (2001) 
and Wei et al. Nature 399: 243-46 (1999)0 

[0046] Laser desorption/ionization time-of-flight mass spectrometry allows the 
generation of large amounts of information in a relatively short period of time. A biological 
sample is applied to one of several varieties of a support that binds all of tiie biomarkers, or a 
subset thereof, in the sample. Cell lysates or samples are directly applied to these surfaces in 
volumes as small as 0.5 |xL, with or without prior purification or fi*actionation. The lysates or 
sample can be concentrated or diluted prior to application onto the support sxuface. Laser 
desorption/ionization is then used to generate mass spectra of the sample, or samples, in as 
little as three horns. 

[0047] In another embodiment, the total mRNA fitom a cellular extract of the individual is 
assayed, and the various mRNA species that are obtained firom the biological sample are used 
as biomarkers. Profiles may be obtained, for example, by hybridizing these mKNAs to an 
array of probes, which may comprise oligonucleotides or cDNAs, using standard methods 
known in the art. Alternatively, the mRNAs may be subjected to gel electrophoresis or 
blottii^ methods such as dot blots, slot blots or Northern analysis, all of which are known in 
the art. {See, e.g., Sambrook et al in "Molecular Cloning, 3'^ ed.," Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (2001).) mRNA profiles also may be 
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obtained by reverse transcription followed by amplification and detection of the resulting 
cDNAs, as disclosed by Stordeur et aL, supra, for example. In another embodiment, the 
profile may be obtained by using a combination of methods, such as a nucleic acid array 
combined with mass spectroscopy. 

Use of a Data Analysis Algorithm 

[0048] In one embodiment, comparison of the individxiaPs biomarker profile to a 
reference biomarker profile comprises applying a decision rule. The decision rule can 
comprise a data analysis algorithm, such as a computer pattern recognition algorithm. Other 
suitable algorithms include, but are not limited to, logistic regression or a nonparametric 
algorithm that detects differences in the distribution of feature values (e.g., a Wilcoxon 
Signed Rank Test). The decision rule may be based upon one, two, three, four, five, 10, 20 or 
more features. In one embodiment, the decision rule is based on hundreds or more of 
features. Applying the decision rule may also comprise using a classification tree algorithm. 
For example, the reference biomarker profile may comprise at least three features, where Ihe 
features are predictors in a classification tree algorithm. The data analysis algorithm predicts 
membership wilMn a population (or class) with an accuracy of at least about 60%, at least 
about 70%, at least about 80% and at least about 90%. 

[0049] Suitable algorithms are known in. the art, some of which are reviewed in Hastie et 
at, supra. Such algorithms classify complex spectra fi-om biological materials, such as a 
blood sample, to distinguish individuals as normal or as possessing biomarker expression 
levels characteristic of a particular disease state. While such algorithms may be used to 
increase the speed and efficiency of the application of the decision rule and to avoid 
investigator bias, one of ordinary skill in the art will realize that computer-based algorithms 
are not required to carry out the methods of the present invention. 

[0050] Algorithms may be applied to the comparison of biomarker profiles, regardless of 
the method that was used to generate the biomarker profile. For example, suitable algorithms 
can be applied to biomarker profiles generated using gas chromatography, as discussed in 
Harper, "Pyrolysis and GC in Polymer Analysis," Dekker, New York (1985). Further, 
Wagner et al. Anal Chem 74: 1824-35 (2002) disclose an algorithm that improves the ability 
to classify individuals based on spectra obtained by static time-of-flight secondary ion mass 
spectrometry (TOF-SIMS). Additionally, Bri^t et al, J. Microbiol Methods 48: 127-38 
(2002) disclose a method of distinguishing between bacterial strains with high certainty (79- 
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89% correct classification rates) by analysis of MALDI-TOF-MS spectra. Dalluge, 
FreseniusJ, Anal Chem, 366: 701-11 (2000) discusses the use of MALDI-TOF-MS and 
liquid chromatography-electrospray ionization mass spectrometry (LC/ESI-MS) to classify 
profiles of biomarkers in complex biological samples. 

Biomarkers 

[0051] The methods of the present invention can be carried out by generation of a 
biomarker profile that is diagnostic or predictive of sepsis or SIRS. Because profile 
generation is sufficient to carry out the invention, the biomarkers tiiat constitute the profile 
need not be known or subsequently identified. 

[0052] Collectively, endogenous biomarkers usefid in determining the status of sepsis or 
diagnosing SIRS in an individual accordmg to the methods of the present invention are herein 
defined as "host response biomarkers." Biomarkers that can be used to generate the 
biomarker profiles of the present invention may include those known to be informative of the 
state of the immime system in response to infection; however, not aU of these biomarkers 
may be equally informative. These biomarkers can include hormones, autoantibodies, 
soluble and insoluble receptors, growth factors, transcription factors, cell surface markers and 
soluble markers from the host or firom the pathogen itself, such as coat proteins, 
lipopolysaccharides (endotoxin), lipoteichoic acids, etc. Other biomarkers include, but are 
not limited to, cell-sxirface proteins such as CD64 proteins; GDI lb proteins; HLA Class II 
molecules, including HLA-DR proteins and HLA-DQ proteins; CD54 proteins; CD71 
proteins; CD86 proteins; surface-bound tumor necrosis factor receptor (TNF-R); 
pattern-recognition receptors such as TolUike receptors; soluble markers such as interleukins 
IL-1, IL-2, IL-4, IL-6, IL-8, BL-IO, IL-11, IL-12, IL-13, and IL-18; tumor necrosis factor 
alpha (TNF-a); neopterin; C-reactive protein (CRP); procalcitonin (PCT); 6-keto Fla; 
thromboxane B2; leukotrienes B4, C3, C4, C5, D4 and E4; interferon gamma (IFNy); 
interferon alpha/beta (IFN a/p); lymphotoxin alpha (LTa); complement components (C*); 
platelet activating factor (PAF); bradykinin, nitric oxide (NO); granxilocyte 
macrophage-colony stimulating factor (GM-CSF); macrophage iiJiibitory factor (MIF); 
interleukin-1 receptor antagonist (IL-lra); soluble tumor necrosis factor receptor (sTNFr); 
soluble interleukin receptors sIL-lr and sIL"2r; transforming growth factor beta (TGFp); 
prostaglandin E2 (PGE2); granulocyte-colony stimulating factor (G-CSF); and other 
inflammatory mediators, (Reviewed in Oberholzer et al^ Shock 16: 83-96 (2001) and 
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Vincent et al in "The Sepsis Text/' Carlet et al^ eds, (Kluwer Academic Publishers, 2002).) 
Biomarkers conunonly and clinically associated with bacteremia are also candidates for 
biomarkers useful for the present invention, given the conmion and frequent occurrence of 
such biomarkers in biological samples. Biomarkers can include low molecular weight 
compounds, which can be fragments of proteins or nucleic acids, or they may include 
metabolites. The presence or concentration of the low molecular weight compounds, such as 
metabolites, may reflect a phenotypic change that is associated with sepsis and/or SIRS. In 
particular, changes in tihe concentration of small molecule biomarkers may be associated with 
changes in cellular metabolism that result from any of the physiological changes in response 
to SIRS and/or sepsis, such as hypothermia or hyperthermia, increased heart rate or rate of 
respiration, tissue hypoxia, metabolic acidosis or MOD. Biomarkers may also include RNA 
and DNA molecules tiiat encode protein biomarkers. 

[0053] Biomarkers can also include at least one molecule involved in leukocyte 
modulation, such as neutrophil activation or monocyte deactivation. Increased expression of 
CD64 and CDl lb is recognized as a sign of neutrophil and monocyte activation. (Reviewed 
in Oberholzer et al, supra and Vincent et ah, supra,) Among those biomarkers that can be 
useful in the present invention are those that are associated with macrophage lysis products, 
as can markers of changes in cytokine metabolism- (See Gagnon et al. Cell 1 10; 1 19-3 1 
(2002); Oberholzer, et aL, supra; Vincent, et. aL, supra.) 

[0054] Biomarkei^ can also include signaling factors known to be involved or discovered 
to be involved in the inflammatory process. Signaling factors may initiate an intracellular 
cascade of events, including receptor binding, receptor activation, activation of intracellular 
kinases, activation of transcription factors, changes in the level of gene transcription and/or 
translation, and changes in metabolic processes, etc. The signaling molecules and the 
processes activated by these molecules collectively are defined for the purposes of the present 
invention as "biomolecules involved in the sepsis pathway." The relevant predictive 
biomarkers can include biomolecules involved in the sepsis pathway. 

[0055] Accordingly, while the methods of the present invention may use an unbiased 

i 

approach to identifying predictive biomarkers, it will be clear to the artisan that specific 
groups of biomarkers associated with physiological responses or with various signaling 
pathways may be the subject of particular attention. This is particidarly the case where 
biomarkers from a biological sample are contacted with an array that cm be vised to measure 
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the amount of various biomarkers through direct and speciJBc interaction with the biomarkers 
(e.g.^ an antibody array or a nucleic acid array). In this case, the choice of the components of 
the array may be based on a suggestion that a particular pathway is relevant to the 
determination of the status of sepsis or SIRS in an individual. The indication that a particular 
biomolecule has a feature that is predictive or diagnostic of sepsis or SIRS may give rise to 
an expectation that other biomolecules that are physiologically regulated ia a concerted 
fashion likewise may provide a predictive or diagnostic feature. Hie artisan will appreciate, 
however, that such an expectation may not be realized because of the complexity of 
biological systems. For example, if the amount of a specific mRNA biomarker were a 
predictive feature, a conceited change in mRNA expression of another biomarker might not 
be measurable, if the expression of the other biomarker was regulated at a post-translational 
level. Further, the mRNA expression level of a biomarker may be affected by multiple 
converging pathways that may or may not be involved m a physiological response to sepsis. 

[0056] Biomarkers can be obtained from any biological sample, which can be, by way of 
example and not of limitation, blood, plasma, saliva, serum, urine, cerebral spinal fluid, 
sputum, stool, cells and cellular extracts, or other biological fluid sample, tissue sample or 
tissue biopsy from a host or patient. The precise biological sample that is taken from the 
individual may vary, but the sampUng preferably is minimally invasive and is easily 
performed by conventional techniques. 

[0057] Measurement of a phenotypic change may be carried out by any conventional 
technique. Measurement of body temperature, respiration rate, pidse, blood pressure, or other 
physiological parameters can be achieved via clinical observation and measxirement. 
Measurements of biomarker molecules may include, for example, measurements that mdicate 
the presence, concentration, expression level, or any other value associated with a biomarker 
molecule. The form of detection of biomarker molecules typically depends on the method 
used to form a profile of these biomarkers from a biological sample. For instance, 
biomarkers separated by 2D-PAGE are detected by Coomassie Blue staining or by silver 
staining, which are well-estabUshed in the art. 

Isolation of Useful Biomarkers 

[0058] It is expected that useM biomarkers will include biomarkers that have not yet 
been identified or associated with a relevant physiological state. In one aspect of flie 
invention, useful biomarkers are identified as components of a biomarker profile from a 
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biological sample. Such an identification may be made by any well-known procedure in the 
art, including immunoassay or automated microsequencing. 

[0059] Once a useful biomarker has been identified, the biomarker may be isolated by 
one of many well-known isolation procedures. The invention accordingly provides a method 
of isolating a biomarker that is diagnostic or predictive of sepsis comprising obtaining a 
reference biomarker profile obtained from a population of individuals, identifying a feature of 
the reference biomarker profile that is predictive or diagnostic of sepsis or one of the stages in 
the progression of sepsis, identifying a biomarker that corresponds with that feature, and 
isolating the biomarker. Once isolated, the biomarker may be used to raise antibodies that 
bind the biomarker if it is a protein, or it may be used to develop a specific oligonucleotide 
probe, if it is a nucleic acid, for example. 

[0060] The skilled artisan will readily appreciate that useful features can be further 
characterized to determine the molecular structure of the biomarker. Methods for 
characterizing biomolecules in this fashion are well-known in the art and include high- 
resolution mass spectrometry, infrared spectrometry, ultraviolet spectrometry and nuclear 
magnetic resonance. Methods for determining the nucleotide sequence of nucleic acid 
biomarkers, the amino acid sequence of polypeptide biomarkeis, and the composition and 
sequence of carbohydrate biomarkers also are well-known in the art. 

AppMcation of the Present Invention to SIRS Patients 

[0061] In one embodiment, the presently described methods are used to screen SIRS 
patients who are particularly at risk for developing sepsis. A biological sample is taken from 
a SIRS-positive patient, and a profile of biomarkers in the sample is compared to a reference 
profile from SIRS-positive individuals who eventually progressed to sepsis. Classification of 
the patient's biomarker profile as corresponding to the reference profile of a SIRS-positive 
population that progressed to sepsis is diagnostic that the SIRS-positive patient will likewise 
progress to sepsis. A treatment regimen may then be initiated to forestall or prevent the 
progression of sepsis. 

[0062] In another embodiment, the presently described methods are used to confirm a 
clinical suspicion that a patient has SIRS, In this case, a profile of biomarkers in a sample is 
compared to reference populations of individuals who have SIRS or who do not have SIRS. 
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Classification of the patient's biomarker profile as corresponding to one population or the 
other then can be used to diagnose the individual as having SIRS or not having SIRS. 

Examples 

[0063] The following examples are representative of the embodiments encompassed by 
the present invention and in no way limit the subject embraced by the present invention. 

1.1, Biological Samples Received and Analyzed 

[0064] Reference biomarker profiles were established for two populations of patient 
volunteers. The first population ('*the SIRS group'*) represents patients who developed SIRS 
and who entered into the present study at 'T)ay but who did not progress to sepsis during 
their hospital stay. The second population Cthe sepsis group") represents patients who 
likewise developed SIRS and entered into the present study at Day 1 but who progressed to 
sepsis typically at least several days after entering the study. Blood samples were taken about 
every 24 hours from each study group- Clinical suspicion of sepsis in liie sepsis group 
occurred at 'time 0." "Time -24 hours" and 'Hime -48 hours" represent samples taken 24 
hours and 48 hours, respectively, preceding the day of clinical suspicion of the onset of sepsis 
in the sepsis group. That is, the samples from the sepsis group included those taken on the 
day of entry into the study (Day 1), 48 hours prior to clinical suspicion of sepsis (time -48 
hours), 24 hours prior to clinical suspicion of sepsis (time —24 hours), and on Ihe day of 
clinical suspicion of the onset of sepsis (time 0). 

1.2, Analysis of mRNA from the biological samples 

[0065] Whole blood samples isolated from a patient were extracted to remove mRNA 
using methods known to one of orcBnary skill hi the art. A suitable RNA isolation procedure 
is found, for example, in RNA Methodologies, A Laboratory Guide for Isolation and 
Characterization, 2"^*^ ed., R.E. Farrell, Jr., ed.. Academic Press (1998) at pp. 55-104. A 
filter-based total RNA isolation approach may be used, as described for the use with the 
RNAqueous™ system (Phenol-Free Total RNA Isolation Kit, Catalog #1912, version 9908; 
Austin, Texas). The procedures used for RNA isolation and subsequent manipulations are 
further described in WO 03/040404, which is assigned to Source Precision Medicine. 

[0066] Once isolated, selected RNA species were amplified using message specific 
primers or random primers. Specific primers were designed firom data obtained fix>m public 
databases, such as Unigene (National Center for Biotechnology Information, National Library 
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of Medicine, Bethesda, Maryland), Primers were designed to amplify specific RNA 
sequences present in the sample using RT-PCR, using principles generally known by one of 
ordinary skm in the art. WO 03/040404, page 22, lines 24-32.) RNA sequences may be 
amplified using either an isoliiermic or thermic cycler, such as with an ABI9600, 9700, or 
7700 cycler (Applied Biosystems, Foster City, California.) Amplified RNAs may then be 
detected using, for example, fluorescent-tagged detection primers, as described in the 
TaqMan™ system (PGR Reagent Kit, Protocol, part number 402823, revision A (1996), 
Applied Biosystems, Foster City, California). RNA detection and quantification may be 
performed by techniques well-known in the art. (See, e.g., WO 03/040404 at page 23, lines 
5-13.) 

[0067] Following RNA isolation firom a patient's blood and purification of the RNA, the 
following protocol was used to amplify the RNA and react it with a 72-member gene 
expression array having the nucleic acid probes set forth in TABLE 1 : 

Materials 

L Applied Biosystems TaqMan Reverse Transcription Reagents Kit (P/N 808- 
0234). Kit Components: lOX TaqMan RT Buffer, 25 mM Magnesium chloride, 
deoxyNTP mixture. Random Hexamers, RNase Inhibitor, MidtiScribe Reverse 
Transcriptase (50 U/mL), and RNase/DNase free water (DEPC-Treated Water from 
Ambion product number 9915G), or equivalent). 

Methods 

2. RNA samples were removed from the -SO^'C freezer, lhawed at room 
temperatm-e and placed immediately on ice. 

3. The following cocktail of Reverse Transcriptase Reagents was prepared for 
each RT reaction: 

per reaction (mD 



lOXRT Buffer 


10.0 


25mMMgC12 


22.0 


dNTPs 


20.0 


Raodom Hexamers 


5.0 


KNAse Inhibitor 


2.0 


Reverse Transcriptase 


2.5 
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Water 18.5 

Total: 80.0 mL 

4. Each RNA sample was brought up to a total volume of 20 mL m a 1.5 mL 
microcentrifuge tube and 80 mL RT reaction mix were added. The solution was 
mixed by pipetting up and down. 

5- The sample was incubated at room temperature for 10 minutes, 

6. The sample was then incubated at 37°C for 1 hour. 

7. The sample was finally incubated at 90®C for 10 minutes, 

8. The samples were spun for a short time in a microcentrifuge. 

9. The samples were then placed on ice if PGR was to be performed 
immediately. Otherwise, the samples were stored at — 20*'C for future use. 

10. PGR quality control was run on all RT samples using 18S rRNA and p-actin 
mKNA as controls. 

The use of the primer probe with the first strand cDNA was as described 
above. Measurement of amplified RNAs that bound to tiie 72-member gene 
expression array was performed according to the following procedure: 

Materials 

1. 20X Primer/Probe Mix for each gene of interest; 20X Primer/Probe Mix for 
18S endogenous control; 2X TaqMan Universal PGR Master Mix; cDNA transcribed 
from RNA extracted from blood samples; Applied Biosystems 96- Well Optical 
Reaction Plates; Applied Biosystems Optical Caps, or optical-clear film; and 
Applied Biosystem Prism 7700 Sequence Detector. 

Methods 

2. Stocks of each Primer/Probe mix were made containing the Primer/Probe for 
the gene of interest, Primer/Probe for 18S endogenous control, and 2X PGR Master 
Mix, The following example illustrates a typical set up for one gene with 
quadruplicate samples testing two conditions (2 plates). 
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IX rperwem 
2X Master Mix 12.50 
20X 1 8S Primer/Probe Mix 1,25 
20X Gene of interest Primer/Probe Mix L25 
Total 15.00 iiL 

3. Stocks of cDNA targets were made by diluting 95 jxL of cDNA into 2000 pL 
of water. The amoimt of cDNA was adjusted to give Ct values between 10 and 18, 
typically between 12 and 13. 

3. 15 |jL of Primer/Probe mix were pipetted into the appropriate wells of an 
Applied Biosystems 96- Well Optical Reaction Plate. 

4, 10 pL of cDNA stock solution were pipetted into each well of the Applied 
Biosystems 96-Well Optical Reaction Plate. 

5, The plate was sealed with Applied Biosystems Optical Caps, or optical-clear 
fihn. 

6. The plate then was analyzed using an AB Prism 7700 Sequence Detector. 

[0068] When using a TaqMan format, the number of amplification cycles required to 
produce a threshold level of fluorescence provides a setni-quantitative estimate of the amoimt 
of mRNA corresponding to each gene that was in the sample. When conducted in this 
fashion, the average coefficient of variation (SD/average x 100) of measurement is typically 
less than 5%, and may be less than 2%. A method of quantifying real-time PGR 
amplification is described in Hira5rama et al^ Blood 92: 46-52 (1998), for example. 
Additional aspects of quantitative real-time amplification are described in WO 03/040404 at 
pages 20-23, for example, and variations of such methods are well-known in the art. 

1,3 Data Analysis and Results 

[0069] For each sample, the concentrations of the mRNAs that boimd 72 different cDNA 
probes on the expression array were measured. In this example, each mRNA is a biomarker, 
and the concentration of each in the sample can be a feature of that biomarker. The 
concentration of the biomarkers was determined by their ability to form specific duplexes 
with the various cDNA probes of the array. The cDNA probes of the array correspond to 
genes that encode proteins that include various cj^okines, chemokines or growth factors, cell 
markers, proteinases or proteinase inhibitors, receptors, transcription regulators, and 
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enzymes, as shown in TABLE 1 . The probes indicated with the asterisks are present on the 
array described at pages 46 - 52 of WO 03/040404. 

TABLE 1 



Marker 


Description 


Marker 


Description 


APAFl 


Apoptotic Protease Activating Factor 
1 


ILIA* 


Interleukin 1, alpha 


ARG2 


Extra-Hepatic Arginase 


ILIB* 


Ihterleukin 1, beta 


BPI 


Bactericidal/permeability-increasing 
protein 


ILlRi 


Interleukin 1 receptor, type I 


CIQA* 


Complement component 1, q 
subcomponent, alpha polypeptide 


ILIRN* 


hiterleukin 1 receptor antagonist 


CALCA 


Calcitonin/Calcitonin Gene Related 
Peptide 


IL2 * 


Ihterleukin 2 


CASPl 


Caspase 1 


IL4* 


Interleukin 4 


CASP3 


Caspase 3 


IL5 * 


Ihterleukin 5 


CCL3 


Chemokine (CC-motif) ligand 3 


IL6* 


Interleukin 6 (interferon, beta 2) 


CCRl 


Chemokine (COmotif) receptor 1 


IL8* 


Interleukin 8 


CCR3 


Chemokine (CC-motif) receptor 3 


rroAM 


Integrin, AIpha-M 


CD14* 


CD14 antigen 


JUN 


v-jun avian sarcoma virus 17 

oncogene homolog 


CD19* 


CD19 antigen 


LBP 


Lipopolysaccharide-binding protein 


CD4 * 


CD4 antigen 


MBL2 


Mannose-binding lectin 2 


CD86 


CD86 antigen 


ME? 


Macrophage Migration Bnthibitory 
Factor 


CD8A* 


CDS antigen, alpha polypeptide 


MMP9 * 


Matrix metalloproteinase 9 
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CRP 


C-reactive protein 


NFKBl 


Nuclear factor of kappa light 
polypeptide gene enhancer in B-cells 
1 (pl05) 


CSF2* 


Granulocyte-monocyte colony 
stimulating factor 


NFKBIB 


Nuclear factor of kappa light 
polypeptide gene enhancer in B-cells 
inhibitor, beta 


CSF3 * 


Colony stimulating &ctor 3 


NOSl 


Nitric oxide syntiiase 1 (neuronal) 


CXCLIO 


Chemokine (CXC motif) ligand 10 


NOS3 


Nitric oxide-synthase 3 (endothelial) 


DTR 


Diphtheria toxin receptor (heparin- 
binding epidermal growth &ctor-like 
growth factor) 


PLA2G7 * 


Phospholipase A2, group VU (platelet 
activating factor acetylhydrolase, 

plasma) 


ELA2 


Neutrophil elastase 


PLAU* 


Plasminogen activator, urokinase 


F3* 


Coagulation factor ni 


SERPINEl * 


Serine (or cysteine) protease inhibitor, 
clade B (ovalbumin), member 1 


FCGRIA 


Fc fragment of IgG, high affinity lA 
receptor 


SFTPD 


Sur&ctant, pulmonary-associated 
protein D 


FTL 


Ferritin, light polypeptide 


STAT3 


Signal transduction and activator of 
transcription 3 


GZMB 


Granz3nTie B 


TGFBl * 


Transforming growth factor-betal 


HMOXl * 


Heme oxygenase (decycling) 1 


TGFBR2 


Transforming growdi factor-beta 
receptor 2 


HSPAIA * 


Heat shock protein 70 


TIMPl * 


Tissue inhibitor of metalloproteinase 
1 


ICAMl * 


Intercellular adhesion molecule 1 


TLR2 


Toll-like receptor 2 


IFI16 


Gamma interferon inducible protein 
16 


TLR4 


Toll-like receptor 4 


IFNA2* 


Interferon, alpha 2 


TNF* 


Tumor necrosis fector, alpha 
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IFNG* 


Interferon, gamma 


TNFRSF13B 


Tumor necrosis factor receptor 
superfamily, member 13b 


ILIO* 


Interleukin 10 


TNFSF13B * 


Tumor necrosis fector (ligand) 
super&mlly, member 13b 


IL12B* 


Interleukin 12 p40 


TNFSF5* 


Tumor necrosis fector (ligand) 
super&nily, member 6 


IL13 * 


Interleukin 13 


TNFSF6* 


Tumor necrosis factor (ligand) 
super&mily, member 6 


IL18* 


Interleukin 18 


TREMl 


Triggering receptor expressed on 
myeloid cells 1 


IL18R1 * 


Interleukin 18 receptor 1 


VEGF* 


Vascular endothelial growtli factor 



1 .3.1 ■ Cross-Validation 
[0070] Various approaches may used to identify features that can inform a decision rule 
to classify individuals into the SIRS or sepsis groups, which are described below. A selection 
bias can affect the identification of features that inform a decision rule, when the decision 
rule is based on a large number of features from relatively few biomarker profiles. (See 
Ambroise etaL, Proa Nat '1 Acad Sci. USA 99: 6562-66 (2002).) Selection bias may occur 
when data are used to select features, and performance then is estimated conditioned on the 
selected features with no consideration made for the variability in the selection process. The 
resxilt is an overestimadon of the classification accuracy. Without compensation for selection 
bias, classification accuracies may reach 100%, even when the decision rule is based on 
random input parameters. (Id,) Selection bias may be avoided by including feature selection 
in the performance estimation process, whether that performance estimation process is 10- 
fold cross-validation or a type of bootstrap procedure, (See, e.g., Hastie et aly supra, at 7,10 
— 7. 1 1 , herein incorporated by reference.) 

[0071] In one embodiment of the present invention, model performance is measured by 
ten-fold cross-validation. Ten-fold cross-validation proceeds by randomly partitioning the 
data into ten exclusive groups. Each group in turn is excluded, and a model is fitted to the 
remaining nine groups. The fitted model is applied to the excluded group, and predicted class 
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probabilities are generated. The predicted class probabilities can be compared to the actual 



class memberships by simply generating predicted classes. For example, if the probability of 
sepsis is, say, greater than 0.5, the predicted class is sepsis. 

[0072] Deviance is a measure comparing probabilities with actual outcomes. As used 
herein, "deviance" is defined as: 



where P is the class probability for the specified class. Deviance is minimized when class 
probabilities are high for the actual classes. Two models can make the same predictions for 
given data, yet a preferred model would have a smaller predictive deviance. For each of the 
10 iterations in the ten-fold cross-validation, the predicted deviance is calculated for the cases 
left out of the model fitting during that iteration. The result is 10 unbiased deviances. 
Typically, these 10 deviances are summed to create a general summary of model performance 
(i.e., accuracy) on the total data set. Because in fact 10 different models were fit, cross- 
validation does not prove the performance of a specific model. Rather, the 10 models were 
generated by a common modeling process, and cross-validation proved the performance of 
this process. An eleventh model arising fi-om this process will likely have predictive 
performance similar to those of the first 10. Use of a ten-fold cross-validation typically 
results in a model performance of less than 100%, but the performance obtained after ten-fold 
cross-validation is expected to reflect more closely a biologically meaningfiil predictive 
accuracy of the decision rule, when applied to biomarker profiles obtained from samples 
outside of the training set. 

1,3.2, Classification Tree Analysis 
[0073] One approach to analyze this data is to use a classification tree algorithm that 
searches for patterns and relationships in large datasets, A "classification tree" is a recursive 
partition to classify a partictilar patient into a specific class (e.g., sepsis or SIRS) using a 
series of questions that are designed to accurately place the patient into one of the classes. 
Each question asks whether a patient's condition satisfies a given predictor, with each answer 
being used to guide the user down the classification tree until a class into which the patient 
falls can be determined. As used herein, a "predictor" is a range of values for a feature. In 
this Example, the feature is a concentration of a nucleic acid biomarker. The nucleic acid 
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biomarkers may be selected from those listed in any one of TABLES 2—10, but the skilled 
artisan will understand that other nucleic acid biomarkers may be usefiil for the invention. 
The "condition'* is the single, specific value of the feature that is measured in the individual's 
biomarker profile. In this example, the "class names" are sepsis and SIRS- Thus, the 
classification tree user first will ask if a first nucleic acid biomarker concentration measured 
in the individual's biomarker profile falls within a given range of the first feature's predictive 
range. The answer to the first question may be dispositive in determining if the individual 
has SIRS or sepsis. On the other hand, the answer to the first question may fijrther direct the 
user to ask if a second nucleic acid biomarker concentration measured in the individual's 
biomarker profile falls within a given range of the second feature's predictive range. Again, 
the answer to the second question may be dispositive or may direct the user fiarther down the 
classification tree until a patient classification is ultimately determined. 

1.3,3, Multiple Additive Regression Trees 
[0074] An automated, flexible modeling technique that uses multiple additive regression 
trees (MART) was used to classify sets of features as belonging to one of two populations. A 
MART model uses an ioitial offset, which specifies a constant that applies to all predictions, 
followed by a series of regression trees. Its fitting is specified by the number of decision 
points in each tree, the number of trees to fit, and a "granularity constant" that specifies how 
radically a particular tree can influence the MART model. For each iteration, a regression 
tree is fitted to estimate the direction of steepest descent of the fitting criterion. A step having 
a length specified by tiie granularity constant is taken in that direction. The MART model 
then consists of the initial offset plus the step provided by the regression tree. The 
diflferences between the observed and predicted values are recalculated, and the cycle 
proceeds again, leading to a progressive refinement of the prediction. The process continues 
either for a predetermined number of cycles or until some stopping rule is triggered. 

[0075] The number of splits in each tree is a particularly meaningful fitting parameter. If 
each tree has only one split, the model looks only at one feature and has no capability for 
combining two predictors. If each tree has two splits, the model can accommodate two-way 
interactions among features. With three trees, the model can accommodate three-way 
interactions, and so fortii. 

[0076] The value of sets of features in predicting class status was determined for data sets 
with features and known class status (e.g., sepsis or SIRS), MART provides a measure of the 
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contribution or importance of individvial features to the classification decision rule. 
Specifically, the degree to which a single feature contributes to the decision rule upon its 
selection at a given tree split can be measured to provide a ranking of features by their 
importance in determining the final decision rule. Repeating the MART analysis on the same 
data set may yield a sli^tly different ranking of features, especially with respect to those 
features that are less important in establishing the decision rule. Sets of predictive features 
and their corresponding biomarkers useful in the present invention, therefore, may vary 
slightly from those set forth herein. 

[0077] One implementation of the MART technology is found in a module, or "package/' 
for the R statistical programming environment {see Venables et al , in Modern Applied 
Statistics with S, 4* ed. (Springer, 2002); www.r-project.org). Results reported in this 
document were calculated using R versions 1 ,7.0 and L7. 1 . The module implementing 
MART, written by Dr. Greg Ridgeway, is called "gbm" and is also fi:eely available for 
download {see www.r-project.org). The MART algorithm is amenable to ten-fold cross- 
validation. The granularity parameter was set to 0.05, and the gbm package's internal 
stopping rule was based on leaving out 20% of the data cases at each marked iteration. The 
degree of interaction was set to one, so no interactions among features were considered. The 
gbm package estimates the relative importance of each feature on a percentage basis, which 
cumulatively equals 100% for all the features of the biomarker profile. The features with 
highest importance, which together account for at least 90% of total importance, are reported 
as potentially having predictive value. Note that the stopping rule in the fitting of every 
MART model contributes a stochastic component to model fitting and feature selection. 
Consequently, multiple MART modeling runs based on the same data may choose slightly, or 
possibly even completely, different sets of features. Such different sets convey the same 
predictive information; therefore, all the sets are usefid hi the present invention. Fitting 
MART models a sufficient number of times is expected to produce all the possible sets of 
predictive features within a biomarker profile. Accordingly, the disclosed sets of predictors 
are merely representative of those sets of features that can be used to classify individuals into 
populations, 

[0078] Data firom nucleic acid biomarker profiles obtained firom various samples were 
analyzed xising MART, as described above. In this analysis, the time 0 hours sepsis 
population consisted of 23 patients and the SIRS population consisted of 24 patients, while 
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the time —24 hours aad time —48 hotirs populations consisted of 24 and 21 individuals with 
sepsis and SIRS, respectively. Features corresponding to all 72 of the biomarkers listed in 
TABLE 1 were analyzed. 

[0079] For the time 0 hours populations, the fitted model included 23 trees, and the model 
allowed no interactions among the features. Using ten-fold cross-validation, the model 
correctly classified 19 of 24 SIRS patients and 15 of 23 sepsis patients, giving a model 
sensitivity of 65% and a specificity of 79%. The biomarkers are ranked in order of 
importance, as determined by the model, in TABLE 2. All features with zero importance are 
excluded. Markers indicated with a sign of "-1" were expressed at progressively higher 
levels in sepsis-positive populations as sepsis progressed, while those biomarkers with a sign 
of"!" were expressed at progressively lower levels. 



TABLE 2 

feature importance by MART analysis: time 0 hours samples 





Biomarker 


Importance 


Sign 


1 


FCGRIA 


23.769 




2 


ARG2 


20.121 




3 


CD86 


19.466 




4 


IL18R1 


10.190 




5 


MMP9 


9.1892 




6 


CD4 


9.0302 




7 


ILIB 


4.7283 




8 


IL8 


1.9926 




9 


TLA 


1.5133 





[0080] For flie time -24 hours populations, the fitted model included 12 trees, and the 
model allowed no interactions among the features. Using ten-fold cross-validation, the model 
correctly classified 15 of 21 SIRS patients and 17 of 24 sepsis patients, giving a model 
sensitivity of 71% and a specificity of 71%. The biomarkers are ranked in order of 
importance, as determined by the model, in TABLE 3. 
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TABLE 3 

feature importance by MART analysis: time -24 hours samples 





Biomarker 


Importance 




1 


FCGRIA 


60.650 


-1 


2 


ARG2 


15.188 


-1 


3 


CD4 


7.3189 


1 


4 


IL8 


6.0636 


1 


5 


TLR4 


5.3904 


-1 


6 


CSF2 


5.3896 


1 



[0081] For the time -48 hours populations, the fitted model included nine trees, and the 
model allowed no interactions among the features. Using ten-fold cross-validation, the model 
correctly classified 8 of 21 SIRS patients and 20 of 24 sepsis patients, giving a model 
sensitivity of 83%, a specificity of 38%, and an accuracy of 62%. The biomarkers are ranked 
in order of importance, as determined by the model, in TABLE 4. 



TABLE 4 

feature importance by MART ana^sis: time -48 hours samples 





Biomarker 


Importance 




1 


CD4 


49.232 


1 


2 


ARG2 


18.450 


-1 


3 


MMP9 


13.778 


-1 


4 


HSPAIA 


10.662 


-1 


5 


LBP 


7.8786 


1 



13,4. Logistic Regression Analysis 
[0082] Logistic regression provides yet another means of analyzing a data stream from 
the analysis described above- "Signal intensity" is equivalent to a concentration of a 
particular nucleic acid biomarker. The absence of a signal for a given nucleic acid biomarker 
results in an assigned signal intensify of "0." The standard deviations (SD) of the signal 
intensities from a given nucleic acid biomarker are liien obtained firom the profiles of the 
combined SIRS and sepsis populations. If there is no variation in signal intensity between 
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SIRS and sepsis popiilations (z.^., the SD = 0), the signal intensity is not considered fijrther. 
Before regression analysis, signal intensities are scaled, using methods well known in the art 
Scaling algorithms are generally described in Hastie et al^ supra^ at Chapter 1 1. 

13.5, Wilcoxon Signed Rank Test Analysis 

[0084] In yet another method, a nonparametric test such as a Wilcoxon Signed Rank Test 
can be used to identify individual biomarkers of interest. The features in a biomarker profile 
are assigned a "p-value," A?v*ich indicates the degree of certainty with which the biomarker 
can be used to classify individuals as belonging to a particular reference population. 
Generally, a p-value having predictive value is lower than about 0.05. Biomarkers having a 
low p-value can be used by "diemselves to classify individuals. Alternatively, combinations 
of two or more biomarkers can be used to classify individuals, where the combinations are 
chosen on the basis of the relative p-value of a biomarker. In general, those biomarkers with 
lower p-values are preferred for a given combination of biomarkers. Combinations of at least 
three, four, five, six, 10, 20 or 30 or more biomarkers also can be used to classify individuals 
in this manner. The artisan will understand that the relative p-value of any given biomarker 
may vary, depending on the size of the reference population. 

[0085] Using the Wilcoxon Signed Rank Test, biomarkers that formed specific duplexes 
with (?.e., hybridized to) an expression array having the probes listed in TABLE 1 were 
assigned a p-value by comparison of sepsis and SIRS populations at a given time. That is, p- 
values were sussigned to features from biomarker projBles obtained from biological samples 
taken at time 0 hours, time -24 hours, and time -48 hours. These p-values are listed in 
TABLES 5, 6 and 7, respectively. For this analysis, die sepsis and SIRS populations at 
time 0 (TABLE 5) constituted 23 and 24 i)atients, respectively; and the sepsis and SIRS 
populations at tune -24 hours and time -48 hours (TABLES 6 and 7) constituted 24 and 21 
patients, respectively. 



TABLES 

p-values for features: time 0 hours 





Biomarker 


P-Value 


1 


FCGRIA 


2.2792e-06 


2 


CD4 


6.1183e-06 


3 


IL18R1 


3.0476e-05 
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4 


CD86 


8.8376e-05 


5 


ARG2 


2.3979e-04 


6 


ILIRN 


3.8982e-04 


7 


MMP9 


5.1390e-04 


8 


PLA2G7 


7.1485e-04 


9 


IGAM 


8.6695e-04 


10 


NFSF5 


9.2451e-04 


11 


HSPAIA 


2.4353e-03 


12 


LR4 


3.6131e-03 


13 


NFSF13B 


3.6140e-03 


14 


NOS3 


4.0315e-03 


15 


BLIO 


4.8175e-03 


16 


CCRl 


5.1829e-03 


17 


IFI16 


5.4665e-03 


18 


NFSF6 


6.1913e-03 


19 


IMPl 


7.3280e-03 


20 


ILIRI 


8.8140e-03 


21 


GFBl 


1.1998e-02 


22 


ILIB 


1.7946e-02 


23 


ICAMl 


2.7022e-02 


24 


CD8A 


2.9415e-02 


25 


LR2 


3.3769e-02 


26 


CCR3 


3.71456-02 


27 


GFBR2 


3,9687e-02 


28 


SERPINEl 


4.3568e-02 




TABLE 6 




p-values for featureis: time —24 hours 




Biomarker 


P-Value 


1 


FCGRIA 


3.9756e-06 


2 


CD4 


1.4742e-03 


3 


IL18R1 


1.6002e-03 


4 


ILIO 


6.5097e-03 



36 



wo 2004/043236 



PCT/US2003/036025 



5 


PLA2G7 


8.1124e-03 


6 


ARG2 


8.6978e-03 


7 


IMPl 


9.0121e-03 


8 


NFSF13B 


1.0058e-02 


9 


LR4 


1.5071e-02 


10 


CCRl 


1.7184e-02 


11 


ILIRN 


2.1489e-02 


12 


IGAM 


3.0112e-02 


13 


TT ID 1 




14 


MMP9 


3.38946-02 


15 


NFSF5 


4.7616e-02 




TABLE? 




p-values for features: time -48 hours 




Biomarker 


P-Value 


1 


CD4 


3.2310e-04 


2 


IL18R1 


2.1669e-03 


3 


ARG2 


2.6612e-03 


4 


TIMPl 


7.8084e-03 


5 


MMP9 


1.0743e-02 


6 


PLA2G7 


1.1513e-02 


7 


FCGRIA 


1.2753e-02 


8 


TNFSF6 


2.5873e-02 


9 


ILIRI 


4.0306e-02 


10 


BPI 


4.1490e-02 



[0086] A nonpaxametiic test (e,^., a WUcoxon Signed Rank Test) alternatively can be 
used to find p-values for features that are based on the progressive appearance or 
disappearance of the feature in populations that are progressing toward sepsis. In this form of 
the test, a baseline value for a given feature first is measured, using the data fix>m the time of 
entry into the study (Day 1 samples) for the sepsis and SIRS groups. The feature intensity in 
sepsis and SIRS samples is then compared in, for example, time -48 hour samples, to 
determine whether the feature intensity has increased or decreased from its baseline value. 
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Finally, p-values are assigned to the difference from baseline in a feature intensity in the 
sepsis popidations versus the SIRS populations. The following p-valnes, listed in TABLES 8 
— 10, were obtained when measuring these differences from baseline in p-values. 



TABLE 8 

p-values for features differenced from baseline: time 0 hours 





Biomarker 


Jr-Value 


1 


FGGRIA 


2.670 le-05 


2 


ILIRN 


l.o453e-03 


3 


IL18R1 


3.2954e-03 


4 


MMP9 


5.5538e-03 


5 


IGAM 


5.862o©-03 


6 


ILIB 




7 


LR2 


8.9614e-03 


8 


LR4 


9.8975e-03 


9 


CD4 


1.1616e-02 


10 


CCRl 


1.1619e-02 


11 


NFSF13B 


1.2183e-02 


12 


PLA2G7 


1.3831e-02 


13 


CD86 


1.7683e-02 


14 


ILIO 


2.8201e-02 


15 


CCR3 


3.3319e-02 


16 


ICAMl 


3.3319e-02 



TABLE 9 

p-values for features differenced from baseline: time -24 hours 





Biomarker 


P-Value 


1 


FCGRIA 


2.3809e-04 


2 


IL18R1 


1.7692e-02 


3 


SFPD 


2.3309e-02 


4 


LR4 


3.0107e-02 


5 


NFSF13B 


4.3896e-02 
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TABLE 10 

p-values for features differenced from baseline: time -48 hours 





Biomarker 


P-Value 


1 


ARG2 


1.2741e-02 


2 


LBP 


3.8573e-02 



[0087] Having now ftiUy described the invention with reference to certain representative 
embodiments and details, it will be apparent to one of ordinary skill in the art that changes 
and modifications can be made thereto without departing from the spirit or scope of the 
invention as set forth herein. 
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What is claimed is: 

1. A kit, comprising at least two nucleic acid host response biomarkers selected from the 
group of nucleic acids set forth in any one of TABLES 2—10 and their complements. 

2. The kit of claim 1, wherein the at least two host response biomarkers are selected from 
the group of nucleic acids set forth in TABLE 2 and their complements, 

3. The kit of claim 1, wherein the at least two host response biomarkers are selected from 
the group of nucleic acids set forth in TABLE 3 and their complements. 

4. The kit of claim 1 , wherein the at least two host response biomarkers are selected from 
the group of nucleic acids set forth in TABLE 4 and their complements, 

5. The kit of claim 1, wherein the at least two host response biomarkers are selected from 
the group of nucleic acids set forth in TABLE 5 and their complements. 

6. The kit of claim 1, wherein the at least two host response biomarkers are selected from 
the group of nucleic acids set forth in TABLE 6 and their complements. 

7. The kit of claim 1, wherein the at least two host re^onse biomarkers are selected from 
the group of nucleic acids set forth in TABLE 7 and their complements. 

8. The kit of claim 1, wherein the at least two host response biomarkers are selected from 
the group of nucleic acids set forth in TABLE 8 and their complements. 

9. The kit of claim 1, wherein the at least two host response biomarkers are selected from 
the group of nucleic acids set forth in TABLE 9 and their complements. 

10. The kit of claim 1, wherein the at least two host response biomarkers are selected from 
the group of nucleic acids set forth in TABLE 10 and their complements. 
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11. A kit, comprising at least two oligonucleotide probes capable of hybridizing to at least 
two host response biomarkers selected from the group of nucleic acids set forth in any one of 
TABLES 2-10. 

12. A biomarker profile, comprising at least two features that contribute to the prediction of 
the inclusion of an individual in a reference population with an accuracy of at least about 60%, 
wherein the features are measurable characteristics of a nucleic acid, and wherein the reference 
population is selected from the group consisting of a normal reference population, a SIRS- 
positive reference population, an infected/SIRS-negative reference population, a sepsis-positive 
reference population, a reference population at a stage in the progression of sepsis, a SIRS- 
positive reference population confirmed as having sepsis by conventional techniques after about 
0-36 hours, a SIRS-positive reference population confirmed as having sepsis by conventional 
techniques after about 36 - 60 hours, and a SlRS-positive reference population confirmed as 
having sepsis by conventional techniques after about 60 - 84 hours. 

13. A method of isolating a host response biomarker, comprising: 

(a) obtaining a reference biomarker profile from a population of individuals; 

(b) identifying a feature of said reference biomarker profile lhat is predictive or 
diagnostic of sepsis or one of the stages of sepsis, wherein the feature corresponds 
to a nucleic acid; 

(c) identifying a host response biomarker that corresponds with said feature; and 

(d) isolating said host response biomarker. 
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FIGURE 1 
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FIGURE 2 



